Soil transmitted (or intestinal) helminths and schistosomes affect millions of children worldwide. There are four species of soil transmitted helminths: Ascaris lumbricoides (roundworm), Necator americanus and Ancylostoma duodenale (hookworms), and Trichuris trichura (whipworm). The five species of schistosomes which affect humans include: Schistosoma (S.) mansoni, Schistosoma japonicum, Schistosoma mekongi, Schistosoma intercalatum (which cause intestinal schistosomiasis) and Schistosoma haematobium (which causes urinary schistosomiasis). Mass deworming is applied widely to reduce the consequences of helminth infection, and there have been numerous studies on the effects of deworming on growth, cognition and learning outcomes in children over the past several decades. Systematic reviews and meta-analyses based on aggregate results of the effect of mass deworming on health and education outcomes are conflicting with some showing benefit (Hall, Hewitt et al. 2008; Croke, Hicks et al. 2016) and others not (Taylor-Robinson, Maayan et al. 2015; Welch, Ghogomu et al. 2017). Debate has ensued about whether these conflicting results are due to the influence of variations in effect across individual-level characteristics such as whether children are infected or not and intensity of infection (Hotez, Molyneux et al. 2007; Bundy, Kremer et al. 2009; Montresor, Addiss et al. 2015) as well as setting characteristics such as the sanitation environment and rapidity of reinfection (Campbell, Nery et al.). Mass deworming for soil-transmitted helminth infection and schistosomiasis is recommended one to four times per year in order to reduce worm burden in the updated World Health Organization in endemic areas, depending on prevalence of worm infection (WHO 2017). These updated WHO guidelines cite the Campbell and Cochrane systematic reviews on deworming which both concluded there were little to no effects of deworming on child welfare outcomes which included growth, anaemia and cognitive outcomes (Taylor-Robinson, Maayan et al. 2015; Welch, Ghogomu et al. 2016). Mass deworming can be applied to school-aged children or whole communities. Selective treatment of infected individuals is rarely done due to the high cost of screening for infection. The drugs used include albendazole, mebendazole, levamisole, ivermectin and piperazine for soil-transmitted helminth infection and praziquantel for schistosomiasis. These drugs are usually provided as pills, are inexpensive and can be administered by schoolteachers or parents. The drugs are considered to have few minor and transient side effects, such as gastrointestinal discomfort, headache, nausea, dizziness, oedema, myalgia and vomiting (WHO 2017). Mass deworming is sometimes accompanied by iron, micronutrient or food supplementation in order to correct nutritional deficiencies that may have been caused by worm infections (Taylor, Jinabhai et al. 2001; Friis, Mwaniki et al. 2003; Nga, Winichagoon et al. 2009; de Gier, Campos Ponce et al. 2014; Rajagopal, Hotez et al. 2014). In addition water and sanitation measures may be implemented with mass deworming to reduce exposure and transmission of infections. Even with heavy infections, the nutritional requirements of intestinal worms relative to their human hosts are small. The harm to child welfare is expected to be caused by three factors: 1) malabsorption, 2) tissue damage and bleeding, and 3) loss of appetite (Crawley 2004). STH infections may cause malabsorption of nutrients in their hosts because of damage to the gastrointestinal surfaces. Hookworm infections are associated with anaemia, thought to be due to hookworm feeding on host tissue and to bleeding when they move from one site to another (Hall, Hewitt et al. 2008). Intestinal infections may also lead to reduced appetite which may negatively influence both anthropometric measures and attention in school. Deworming drugs are over 90% effective at reducing the worm load in individuals, and are expected to reduce the prevalence of worm infection in the community as well as the intensity of infection in individuals (Figure 1). Reducing the prevalence and intensity of infection is expected to improve child nutritional status due to the mechanisms described above of reducing blood loss, reducing damage to gastrointestinal surfaces and improving appetite. Improved nutritional status and appetite are expected to improve attention in school and cognitive outcomes. Some have argued that deworming alone is insufficient to improve child welfare outcomes since the nutritional deficiencies caused by infections must be corrected with food and/or micronutrients (Hall, Hewitt et al. 2008). Logic model Many potential effect modifiers have been described in the literature. Younger children may have a greater impact of deworming since they are smaller in size and the impact of infections may be greater on them (Hall, Hewitt et al. 2008). Girls may benefit less from deworming if they have lower school attendance (thus, not receiving deworming given at school) and if there is preferential distribution of food or other resources at home which could influence child welfare. Children who are stunted for age at three years of age may not be able to benefit as much in terms of growth. Conversely, children who are underweight may benefit more from deworming than those of normal weight (Hall, Hewitt et al. 2008). It is expected that benefits of deworming would only accrue to those who are infected, and those with heavier infection intensity (Hall, Hewitt et al. 2008). Low socioeconomic status is expected to be correlated with other features that expose children to repeat intestinal infections, including those that cause diarrhea, and thus children with lower socioeconomic status may not achieve as much benefit as less poor children. Reinfection is expected to depend on the prevalence and intensity of infection as well as environmental factors such as the water and sanitation environment and hygiene practices in the community. A recent Campbell systematic review and network meta-analysis by members of our team (VW, PT, GAW, EG, ZB), with 47 randomized trials and >1 million children, found little to no overall effect on growth, attention and school attendance (Welch, Ghogomu et al. 2016). With network meta-analysis, we were able to explore the size of effect with different types and frequency of drugs and their combination with food or micronutrients; none of which contributed to larger effects. Our review also did not find larger effects in subgroups of children at the aggregate level across characteristics such as age, baseline nutritional status, prevalence or intensity of infection that have been postulated to be important (Welch, Ghogomu et al. 2016). These analyses were conducted at the study level, rather than using data for each individual child, which limits the power to detect effect modification by individual participant characteristics and also may be subject to ecological bias. This review was therefore unable to identify whether mass deworming was more effective for children with certain characteristics. There was substantial unexplained heterogeneity between studies, with some studies finding larger effects than others, and no single individual-level, setting-level or methodology characteristic explaining this variation. Thus, we concluded that our analysis of effect modifiers was limited by the aggregate level data. Our previous review was conducted using network meta-analysis (NMA), which allowed the comparison of treatments which had not been directly compared in head-to-head trials. NMA also allowed for the assessment of the role of multi-component interventions (such as deworming combined with other parasite control interventions, food or micronutrients). Because there are several drugs used for mass deworming, this allowed the assessment of heterogeneity related to the type of drug, frequency and use of concomitant interventions. Individual participant data (IPD) meta-analysis has been called the “gold standard” in meta-analyses for exploring individual level characteristics and their association with effects (Stewart 1995). Advantages of IPD meta-analysis include improving data quality, enabling standardization of outcomes, clarifying risk of bias, and increasing the power to assess the interaction of participant characteristics with effect size (Stewart, Clarke et al. 2015). Furthermore, IPD analysis can explore the size and direction of differences in effect, thus assessing whether there is a greater benefit for some participants (Early Breast Cancer Trialists’ Collaborative Group 1990). Another advantage of IPD is that they usually require an international collaborative effort, involving trial authors, which may help to identify more relevant trials, and also contribute to an agreed analysis plan and shared understanding of the results. While failure to obtain some datasets may lead to selection bias if there are systematic reasons why some studies do not provide full data, methods have been developed to combine individual participant data with aggregate data (when IPD is not available for some studies) in network meta-analysis (Sutton, Kendrick et al. 2008; Donegan, Williamson et al. 2012). We decided in collaboration with several authors of primary trials that there would be value in conducting an IPD meta-analysis to explore the question of whether mass deworming is more effective for subgroups of children defined by characteristics such as infection intensity or status, age or nutritional status. This understanding could help to develop targeted strategies to reach these children better with deworming and guide policy regarding deworming. The primary objective is to use individual participant data network meta-analysis to explore whether the effects of different types and frequency of deworming drugs as well as their combination with food or micronutrients on anaemia, cognition and growth vary with child-level and environment-level characteristics (see Table 1), specifically: intensity of infection (as assessed by egg count), infection status (including species of worm), age, nutritional status, socioeconomic status and sanitation environment. We report this protocol according to the preferred reporting items for systematic reviews and meta-analyses for protocols (PRISMA-P) (Moher, Shamseer et al. 2015). Results of the review will be reported using the Preferred Reporting items for Systematic Reviews and Meta-analyses of individual patient data (PRISMA-IPD) Statement (Stewart, Clarke et al. 2015). We will include studies which meet the following eligibility criteria: We will include randomized and quasi-randomized trials. For the purpose of determining whether specific individual-level and environment-level characteristics are associated with greater effects of deworming, there is sufficient evidence from over 70 randomized trials with over 100,000 children to include only randomized and quasi-randomized trials. We will include studies reported in abstract form at a conference as well as unpublished studies. We will seek full datasets from all studies and carry out the same methods for data checking and quality for all studies. Children aged six months up to 16 years. We will exclude studies with less than 100 participants because of the time and effort required for each dataset and the information gained from smaller studies will be small compared to larger datasets. We will not exclude studies on the basis of attrition rate from the study. Mass deworming using any drugs for soil transmitted helminths or schistosomes with or without co-interventions such as food, micronutrients, iron or hygiene interventions. Eligible drugs include (but are not limited to) albendazole, praziquantel, levamisole, ivermectin, diethyl carbamazine, pyrantel, piperazine, metrifonate, hycanthone and tetramisole. We will include studies with combined approaches to parasite elimination such as albendazole and praziquantel. Also, because deworming may be used in combination with iron, food or hygiene promotion, we will include studies with multiple component interventions. Studies will be included with placebo, control, or other active interventions (e.g. vitamin A, iron, hygiene promotion) as comparators. As network meta-analysis depends on the assumption of transitivity (that participants could be randomized to any one of the treatments) (Salanti 2012), we will consider two evidence networks of jointly randomizeable interventions of drugs given for two indications. First, we will assess the evidence network of interventions given for soil-transmitted helminths which includes different frequencies of albendazole, mebendazole, levamisole, pyrantel, piperazine, ivermectin and tetramisole with or without micronutrients or food. These are considered jointly randomizable because they are given for the same indication, and many have been compared in multi-arm trials (Salanti 2012). Secondly, we will consider the evidence network of interventions given for schistosomiasis (praziquantel, metrifonate, hycanthone) with or without micronutrients or food. The primary health outcomes are change from baseline in: weight (kg), height (cm), serum ferritin, cognition and hemoglobin (g/L). We will include studies which measure weight, hemoglobin, serum ferritin, cognition or height. Cognition may be measured using scales that measure development (e.g. Raven's matrices) or tests that assess attention using digit recall. We will not exclude on the basis of reported outcomes since some measured outcomes may not be reported in trial reports or abstracts. We will not assess distal outcomes for this review because there is too little data available so an IPD NMA would not add to the literature. We will use the available data on age and sex to calculate height for age, weight for age and weight for height for children <5 years using the 2006 child growth standards (using WHO software Anthro version 3.2.2) and body mass index (BMI) for age for children aged five or older using the WHO Reference 2007 (using WHO AnthroPlus software). Effects on infection intensity and status will be assessed as secondary outcomes. Adverse effects of deworming were assessed in prior systematic reviews as minor and uncommon and the results are not contested, thus we will not assess adverse effects in this review. Since the primary objective of this systematic review is to assess effect modification, particularly as it relates to infection status and intensity, we will exclude studies that do not measure baseline infection prevalence of at least one of the soil-transmitted helminths or schistosomes. For weight and height, we will include data from studies >4 months in duration because we consider this as a minimum duration to observe differences in growth. However, for hemoglobin and ferritin status, changes may occur sooner, so study duration will not be used as an exclusion criterion. While infection status and infection intensity are affected much sooner than this, these are not primary outcomes of interest since there is no question that deworming drugs reduce infection load. We will assess infection intensity and status at baseline as indicators of the force of infection in the population. We will collect data at each available time-point and explore study duration as a covariate in a meta-regression model, if possible. The settings will include any area where soil-transmitted helminths or schistosomes are endemic. We will use the same search strategy as used for a previous Campbell review by members of our team (Welch, Ghogomu et al. 2016). This search was last run on January 14, 2016. See search strategy in Appendix. We will search in the following databases: MEDLINE, CINAHL, LILACS, EMBASE, the Cochrane Library, Econlit, Internet Documents in Economics Access Service (IDEAS), Public Affairs Information Service (PAIS), Social Services Abstracts, Global Health CABI and CAB Abstracts. Grey literature databases will include thesis dissertations and the System for Information on Grey Literature in Europe (SIGLE)-ends in 2005). We will search websites of relevant organizations such as the World Bank, World Food Program and International Food Policy Research Institute, as per the prior Campbell review (Welch, Ghogomu et al. 2016). We will also contact authors of studies and members of our advisory board for any unpublished studies or grey literature reporting eligible studies. We will check reference lists of relevant studies and reviews. Titles and abstracts will be screened in duplicate by two reviewers. We will pilot-test the screening criteria at both title and abstract screening stage and full text stage. We will use the PRISMA flow diagram to report eligibility of studies. We will retrieve full text of all studies which pass this first level screening. The full text review will also be done in duplicate by two reviewers, and agreement will be reached by consensus. Disagreements will be resolved by consultation with a third reviewer. No language limits will be applied. The research team has expertise in English, French and Spanish, and translation will be sought if studies are found in other languages. Randomized controlled trials of deworming include two-arm trials as well as factorial trials, with children allocated either individually or by cluster-randomization (e.g. by village or school). Details of the populations, interventions, comparators, outcomes and study design will be extracted in duplicate by two reviewers, using a pre-tested form, designed for a previous Campbell review on deworming for children (Welch, Ghogomu et al. 2016). This extraction includes details about the context, setting and environment, as well as sociodemographic details, and details about the frequency, delivery method and dose of interventions. Two independent reviewers will appraise each study with the Cochrane risk of bias tool which assesses selection bias, performance bias, detection bias, attrition bias and reporting bias (Cochrane Handbook) (Higgins, Altman et al. 2011). Disagreements will be resolved by discussion or consultation with a third reviewer. We will appraise the GRADE certainty for each outcome for each comparison by two independent reviewers, using the GRADE approach for network meta-analysis (Puhan, Schünemann et al. 2014). GRADE certainty (quality) “reflects our confidence that the estimates of the effect are correct. In the context of recommendations, quality reflects our confidence that the effect estimates are adequate to support a particular recommendation. “Quality as used in GRADE means more than risk of bias and so may also be compromised by imprecision, inconsistency, indirectness of study results, and publication bias” (Balshem, Helfand et al. 2011). The two reviewers will discuss ratings and reach consensus. Disagreements will be resolved by consulting a third reviewer. We will develop a summary of findings table for each main comparison to show the effects for the outcomes of weight, height and haemoglobin, along with the quality of evidence (using GRADE certainty). Data will be prepared into a flat spreadsheet with the same fields for every study. We will consider the missing values for each variable as missing at random (MAR). We will use multiple imputation to impute the missing values for baseline and outcome variables using Proc MI in SAS/STAT(SAS Institute Inc., Cary, NC, USA). Descriptive characteristics of each study will be presented, with details on the child characteristics, environment, worm species, prevalence, and intensity of infection, geographic location, interventions, comparator and outcomes and risk of bias assessment. We expect to have data from individually randomized trials and cluster-randomized trials. We will account for clusters (such as villages, schools or households) as nested within each study. We will analyse IPD datasets to check for comparability with the primary published papers. We will calculate the standardized difference between the published data and the IPD received from authors for baseline characteristics and baseline outcome assessment. For endline, we will replicate the effect measures reported in study publications and calculate the standardized difference between the IPD received and the study report (Austin 2009). Any differences larger than a standardized difference of 0.2 (chosen because it represents a small effect size) will be discussed with authors to attempt to resolve differences. If the reason for differences cannot be explained, the data will not be used or will only be used in sensitivity analyses. As with our previous Campbell review, we will use a two-step process to meta-analysis. We will first conduct pairwise analyses for each comparison of interest by entering all IPD data into a multilevel model, with each study as one cluster. We expect considerable heterogeneity between studies for each outcome based on our Campbell review; therefore, we will use a random effects model. We will assess mean differences in change from baseline for weight (kg), height (cm), serum ferritin (mcg/l), and haemoglobin (g/l). For cognition, we will analyse measures of motor and cognitive development separately. We will analyse measures of attention separately from developmental outcomes. We expect that cognition will be measured using different scales. We will not combine different measures of cognition. If IPD are not available for all trials (as we expect), we will compare study characteristics and the effect sizes from aggregate data of studies which do not provide IPD to the studies which do provide IPD. We will account for clustering as above by nesting clusters within studies. We decided on a set of pre-defined covariates with advice from our advisory board and co-authors. We will account for the covariates of sex, age, infection intensity for each type of agent, socioeconomic status, maternal education and baseline nutritional status in the model. We will assess heterogeneity using visual inspection of forest plots for pairwise analyses as well as statistical tests of heterogeneity (I2). We will conduct network meta-analysis with IPD, using a frequentist approach for random-effects network meta-analysis. The covariates were identified by the Study Advisory Group, namely: age, sex, baseline nutritional status (weight and height), haemoglobin, and infection intensity. The effect estimate chosen is the mean difference for the continuous outcome of interest. The response variables (weight, height and haemoglobin) follow the normal distribution, and the generalized linear model will be used to fit data to determine the parameter estimate.. Random effect GLMM will be conducted with two random effects considered in the model: random effect ‘trial’ accounts for the response variables of patients within a given trial being correlated; and random effect ‘trial*treatment’ accounts for the correlation of responses between any two patients from the same treatment arm within a given trial, subject to sufficient data available. We expect to have a connected network of trials to allow direct and indirect comparisons based on our Campbell review and network meta-analysis (see Figure 2 for evidence network diagram for weight) (Welch, Ghogomu et al. 2016). We will use the GLIMMIX procedure in SAS/STAT (SAS Institute Inc., Cary, NC, USA) for the GLMM network meta-analysis, considering models that account for multi-arm trials and adjust for the covariates identified. Results will be summarized as point estimates with 95% confidence intervals. Evidence Network for weight (n=29 randomized trials, 61,857 participants) We will compare heterogeneity of participant characteristics and trial methodology in tables. Within GLM, the explanatory model will include covariates at the study level and patient characteristics (anaemia, nutritional status, infection intensity, age, sex). We will construct forest plots for treatment comparisons (adjusted using the same covariates as the base case model) and assess heterogeneity by visual inspection and homogeneity statistics. Any environment level or participant-level covariates that are statistically significant will be analyzed using subgroup analyses. Transitivity cannot be assessed statistically. With IPD, we have more opportunity to account for and model heterogeneity. As proposed by Salanti 2012, we will use IPD to assess the distribution of the child-level effect modifiers from Table 1 in each comparison to assess the plausibility of the transitivity assumption (Salanti 2012). As above, transitivity is considered plausible since the treatments in each model (STH and schistosomiasis, respectively) are provided for the same indication and many of the treatments and their cointerventions have been included in multi-arm trials (Welch, Ghogomu et al. 2016). For this IPD NMA, we assume equal variances across comparisons within network. This assumption will be tested using the Levene test (Gastwirth, Gel et al. 2009). Inconsistency in an NMA is defined as a disagreement between the direct estimates (from direct comparisons of treatments) and indirect estimates (which are derived from the network comparisons). We will assess underlying assumptions including consistency using back-calculation technique (Dias, Welton et al. 2010; Wilson, Tanner-Smith et al. 2016) between indirect and direct evidence and will use model diagnostics, including diagnostic plots (e.g. residual plots), to assess model convergence. We will use node-splitting (Dias, Welton et al. 2010; Dias, Welton et al. 2013) to assess consistency of the NMA. These statistical tests will be interpreted with caution since they are underpowered and their power varies with the heterogeneity in the pairwise comparisons. A funnel plot will be plotted for comparisons and outcomes with >10 studies. We will use Egger's test for asymmetry and visual inspection to assess the presence of publication bias and/or selective reporting. We do not plan to rank interventions because there is controversy as to the utility of ranking. Subgroup analyses: Provided sufficient data is available to inform the evidence network, meta-regression and/or sub-group analyses will be conducted to assess effects across both child-level as well as environment-level characteristics. We will compare the results of models with subgroup analyses by assessing the size of quantitative or qualitative differences in effects, the statistical significance of tests for interactions, assessing between-study variance and assessing the goodness of fit of the models using the likelihood ratio. The following child and environment level effect modifiers will be assessed: Before conducting subgroup analyses, we will assess the distribution of each variable. If there are insufficient children in some categories, the levels may be combined. If possible, we will also assess socioeconomic status of household or parents and maternal education as effect modifiers. As noted in Table 1, environment level characteristics will not be entered into the model. They will be assessed by sensitivity analyses We expect poor reporting on these details in the articles based on our prior Campbell review, but some studies may have collected information on this at the study level that were not reported in the paper publications. We will also assess whether there is sufficient data on the geographic location and date of the studies to assess study-level prevalence generated by the Global Atlas of Helminth Infections (GAHI). Sensitivity analyses Provided sufficient data is available to inform the evidence network, we will conduct sensitivity analyses to assess robustness of results when restricted to studies at low risk of bias for sequence generation, allocation concealment and blinding of participants. We will assess whether results are robust to excluding imputed data (i.e. complete case analysis). We will assess sensitivity to restricting to studies published in 2008 or later (last 10 years). Data will be housed at a secure data warehouse at the Bruyère Research Institute, following the personal health information act. Data will be transferred to SAS as a common platform for all studies, using a common data dictionary. VW will check IPD data for consistency immediately upon receiving datasets. For example, we will check for outlier individuals (e.g. with ages outside of eligibility criteria, duplicate participant IDs, unrealistic date ranges). We will compare the IPD from authors with the aggregate data reported in the articles. Any missing or unusual data will be flagged for discussion with the trial author or statistician by VW. We will ask for clarification from the authors to establish reasons for the errors, and correct them if possible. Any requests for authors will be discussed when the data is provided, such as clarification of trial risk of bias, conduct or eligibility criteria. We will also run the same statistical analysis as the authors to check for consistency with the published paper (Stewart, Clarke et al. 2015). We will request statements of ethics approval from each study and we will not include data from studies that did not receive ethics approval. We will request that all data be transferred without any identifiers. We do not plan to include qualitative research. We developed a comprehensive search strategy with support from an information scientist (JM) for electronic databases and grey literature sources such as organizations active in deworming. The draft search strategy underwent review with PRESS (Peer Reviewed Electronic Search Strategies) (Sampson 2009) by John Eyers, information scientist of the Campbell International Development Group, and appropriate changes were made (Appendices A and B) to produce the finalised search strategy. We identified relevant studies to